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Overview 


With watchOS 7, Apple Watch Series 3 and later use an updated algorithm to estimate a 
user's cardio fitness level as measured by VO2 max, the maximum volume of oxygen an 
individual can extract from inhaled air. This update extends VO» max estimates to lower 
ranges while also expanding availability of this metric. Additionally, with watchOS 7.2, 
users can view how their cardio fitness level is classified based on their age group and 
sex in the Health app on iPhone, and they can receive a notification if it falls within the 
low range. This paper provides a detailed understanding of the capabilities of these 
features, including testing and validation. 


Introduction 


Definition 

VO2 max is the maximum volume of oxygen an individual can extract from inhaled air and consume 
through cellular metabolism. As such, VO2 max is a good overall indicator of cardiorespiratory fitness 
(CRF) because it incorporates multiple organ systems and is influenced by a number of factors at various 
points along the pathway, from respiration to end-organ oxygen consumption.’ VO max values are 
typically normalized for body mass and reported as milliliters of oxygen per kilogram of body mass in one 
minute (ml/kg/min). They normally decline with increasing age and, at a population level, differ across 
biological sexes.2 


Measurement and Estimation 

VO2 max is measured during cardiopulmonary exercise testing (CPET) — a procedure in which an individual 
is asked to ride a stationary bike or ambulate on a treadmill at increasing intensity levels while wearing a 
face mask, which allows for the direct measurement of oxygen in inhaled and exhaled air.3 In most cases, 
the volume of oxygen that individuals consume during testing plateaus even though their work increases, 
and this plateau or VO? peak is assumed to be and referred to as the VO max, despite a lack of certainty 
that a true maximum has been achieved.4 


In practice, VO2 max or CRF is more commonly estimated from measurements taken during submaximal 
exertion because these tests are less expensive and less time intensive than maximal CPET, they require 
less exertion from and are more comfortable for the subject, and significant evidence exists to derive VO2 
max from submaximal exertion.® 

Utility 

CRF, as measured by VO2 max or the closely related metabolic equivalent (MET) — with 1 MET = 

~3.5 ml/kg/min — has been repeatedly shown over the last 30 years to be a predictor of all-cause and 
cardiovascular mortality, as well as cardiovascular events for men and women.®.7.8 In some studies, CRF 
was independent of and more predictive than well-known risk factors for cardiovascular and all-cause 
mortality such as hypertension, obesity, and hypercholesterolemia.?.10,11 


Because of this prognostic utility, members of the medical and scientific community have advocated for 
inclusion of measures of CRF in routine medical practice as adjuncts12 to or even in place of traditional 
risk models such as Framingham.13 This predictive utility also applies outside the general population to 
disease-specific cohorts, such as individuals with heart failure, 11 and in clinical decision-making related to 
specific events such as perioperative management'®.16 and referral for cardiac rehabilitation.1? In response 
to these and other demonstrations of utility, the American Heart Association (AHA) in 2016 advocated for 
the assessment of CRF on a more routine basis, making a case for fitness as a vital sign.5 


w 
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Genetics 

Genetics are strongly correlated with an individual's VO max and changes in VO2 max in response to 
exercise. At baseline, genetic factors are believed to determine approximately 50 to 70 percent of the VO» 
max differences observed between individuals18.19 and approximately 20 to 60 percent of the variation in 
VO2 max improvements in response to exercise training.*.20 


Interventions 

Improving or maintaining VO2 max over time is strongly associated with decreased mortality. In a study 

of over 500 men followed for 11 years, Laukkanen et al. found that for every 1 ml/kg/min increase 

in VO2 max, risk of death decreased by 9 percent.21 At a study level, high-intensity interval training yields 
the greatest improvements in VO2 max.22,23,24 Over the course of programs that ranged from 6 to 12 weeks, 
improvements in VO2 max were generally approximately 5 to 10 percent (in ml/kg/min). It's important to 
note that decreases in VO max with decreased activity or inactivity have been reported to be of similar or 
greater magnitude (up to a 27 percent decline) over far shorter time periods (2 to 3 weeks).25.26 Increasing 
physical activity in the absence of improvements to VO max doesn't appear to confer the same survival 
benefit in those whose VO2 max increases compared with those whose VO2 max doesn't.27 


Cardio Fitness on Apple Watch 


This paper describes the development and validation of the cardio fitness metric, an estimate of VO2 
max using Apple Watch. The intended audience of this paper is researchers, healthcare providers, 
and developers who may be interested in using this estimation in their work, as well as customers 
who would like to know more about VO2 max and how it's measured and validated using Apple Watch. 
Additional information on how to set up and view VO2max estimates for Apple Watch can be found 

at support.apple.com/en-us/HT 211856. 


Metric Description 


Cardio fitness on Apple Watch is an estimation of a user's VO2 max in ml/kg/min, made based on measuring 
a user's heart rate response to physical activity. Updates to the algorithm used to estimate VO» max in 
watchOS 7 extend estimates to lower ranges of cardio fitness (14 to 60 ml/kg/min) for users with Apple 
Watch Series 3 or later. A view of VO2 max in the Health app in iOS 14 under Cardio Fitness is shown in 
figure 1. A VO2 max value may be generated after walking, running, or hiking outdoors on relatively flat 
ground (that is, a grade of less than 5 percent incline or decline) with adequate GPS, heart rate signal 
quality, and exertion (an approximate increase of 30 percent of the range from resting heart rate to max). 

A user's first such workout won't generate an estimate, and a user needs to have worn Apple Watch for 
one day before a first estimate can be generated. 
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About Cardio Fitness 


This is a measurement of your VO, max, which 
is the maximum amount of oxygen your body 
can consume during exercise. Also called 
cardiorespiratory fitness, this is a useful 
measurement for everyone from the very fit to 


Figure 1: Cardio fitness in the Health app in iOS 14 


These estimates of VO2 max are based on submaximal predictions of VO2 max rather than peak VO. As 
such, users don't need to achieve peak heart rate to receive an estimate; however, a notion of peak heart 
rate is needed. Because of this, users who take medications that may reduce their peak heart rate can 
indicate in Health Details in the Health app that they're taking this medication to enable more accurate 
VO2 max estimates (see figure 2). 


With a feature introduced in iOS 14.3, users ages 20 and above have the option of being notified if their 
cardio fitness level, as measured by estimated VO2 max, is consistently and confidently low enough to 
suggest risk of long-term health issues or current limitation in daily activities. For users ages 20 to 59, this 
threshold for notification is the lowest quintile for sex and age by decade as determined by the Fitness 
Registry and Importance of Exercise National Database;2 for users ages 60 and above, absolute VO2max 
thresholds of 18 and 15 ml/kg/min are used for males and females, respectively, based on data suggesting 
these are thresholds for independent living at the extremes of age in both sexes.28 Users who want 

to receive Low Cardio Fitness notifications must opt in, which requires completing an onboarding 
experience in the Health app that describes the feature; collects information like age, sex, and relevant 
medications needed to provide an accurate alert; outlines factors that can lower your cardio fitness; and 
provides optional education content describing the importance of VO2max and potential causes of a 
notification (see figure 3). 
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Levels 
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your overall physical health and a predictor 


of your long-term health. 


Apple Watch can estimate your cardio 
fitness level and notify you if it's low. 


Cardio fitness can be maintained or 
improved with physical activity. 


Figure 3: Onboarding for Low Cardio Fitness notifications in the Health app in iOS 14 
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Development 


Study Design 

Apple collected data for the design and validation of the VO2 max metric across multiple studies, approved 
by an institutional review board (IRB), involving participants who consented to the collection and use of 
their data for this purpose. 


Study participants completed VO2 max tests, VO2 submax tests, or both — referred to as cardiopulmonary 
exercise testing, or CPET — while wearing Apple Watch Series 4. A variety of CPET test protocols were 
used, including both treadmill and cycle ergometers. Each participant completed up to six CPETs over the 
course of the studies, with at least 10 days between consecutive tests to ensure participants had sufficient 
time to rest between tests and adequate periods for data collection before and after individual CPETs. 
CPET data was used to verify that the protocol was completed correctly and that participants reached 

at least 60 percent of the predicted maximum heart rate. Tests for which there were a gas exchange 
abnormality, low-quality heart rate signal, detected arrhythmia, reported pain, or biomechanical inefficiency 
were excluded from all further analyses. Tests that passed the verification steps were used in algorithm 
development. To obtain the reference VO2 max of each participant, linear projections were made using 
heart rate and VO in the submax range to determine VO2 max based on age-predicted maximum heart 
rate. Predicted maximum heart rate was lowered for users on heart rate-limiting medications, such as beta 
blockers, according to published predictions.29 


In addition to wearing Apple Watch during proctored CPETs, participants wore Apple Watch and carried 
iPhone during their day-to-day activities throughout the course of the studies. These activities included 
workouts logged by participants. Data from a variety of Apple Watch sensors (photoplethysmograph, 
accelerometer, gyroscope, barometer, and GPS) was collected during this period and used in VO max 
algorithm design. 


A subset of study participants was withheld from all algorithm design data to verify algorithm accuracy 
and guard against overfitting. Algorithm performance was calculated by comparing the last valid VO2 max 
estimate on Apple Watch with the average of submaximal projections from all curated CPETs for each 
participant unless otherwise specified. 


Statistical Methods 

The validity of VO2 max on Apple Watch was computed as the mean and standard deviation of errors 
between the last valid mean VO2 max estimate on Apple Watch and the mean submaximal VO2 max 
projection from all curated CPETs for each participant. Reliability, reported as intraclass correlation 
coefficient (ICC), was evaluated by calculating the absolute agreement per participant between the last 
valid VO max estimate on Apple Watch and a previous VO2 max on Apple Watch estimated at least 28 
days prior. The consistency of VO2 max on Apple Watch is expressed as the median and 90th percentile 
standard deviation per participant of all VO2 max estimates on Apple Watch for participants who had at 
least five estimates. Finally, the availability of VO2 max on Apple Watch is computed in two ways: as the 
percentage of all outdoor pedestrian workouts longer than 5.75 minutes from all participants yielding 

a VO2 max estimate on Apple Watch, and as the percentage of participants who completed at least 

10 outdoor pedestrian workouts longer than 5.75 minutes who received at least one VO2 max estimate 

on Apple Watch after 10 workouts. 
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Results 


Baseline characteristics of the participants whose data was used for design and validation are summarized 
in table 1. 


Table 1. Participant characteristics 


Design (N= 534) Validation (N = 221) 
| Gender — number (%) 
Female 191 (36) 94 (43) 
Male 343 (64) 127 (57) 
Age — years* (mean + SD) 53 +18 55+17 
Age distribution — number (%) 
<45 years 207 (39) 74 (33) 
45-54 years 67 (13) 26 (12) 
55-65 years 57 (11) 36 (16) 
>65 years 203 (38) 85 (38) 
Reference VO2 max — ml/kg/min (mean + SD) 31.7 + 10.6 29.7 + 10.5 
Length of observation — days (mean + SD) 441 + 137 390 + 138 
Comorbidities — number (%) 
Arthritis 51 (10) 17 (8) 
Diabetes 38 (7) 23 (10) 
History of stroke 9 (2) 5 (2) 
Coronary artery disease 41 (8) 24 (11) 
History of myocardial infarction 34 (6) 16 (7) 
COPD 4 (1) 3 (1) 
Heart failure 10 (2) 5 (2) 
Hypertension 121 (22) 47 (21) 
Smoking status (cigarettes) — number (%) 
Current smoker 5 (1) 1(1) 
Former smoker 63 (12) 37 (17) 
Never smoked 300 (56) 129 (58) 
Smoking status unknown 166 (31) 54 (24) 
BMI category — number (%) 
Underweight (BMI < 18.5) 1 (<1) 2 (<1) 
Normal weight (18.5 < BMI < 25.0) 215 (40) 99 (45) 
Overweight (25.0 < BMI < 30.0) 220 (41) 77 (35) 
Obese (BMI > 30.0) 98 (18) 43 (19) 


*Based on year of birth. 
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Algorithm performance for the design and validation data sets is reported in table 2. A plot of reference 
(average CPET-derived VO2 max per user compared with final Apple Watch-estimated VO» max) for the 
design and validation participants is shown in figure 4. Algorithm performance was assessed on data 
collected during workouts. For a subset of participants (132 design and 62 validation), VO2 max was 

also estimated outside of workouts during periods of outdoor walking to assess the ability to estimate 

VO2 max when a workout hasn't been initiated on Apple Watch. In these users, nonworkout estimates were 
an average of 0.32 ml/kg/min higher than workout estimates in the design group. No significant difference 
was detected between workout and nonworkout estimates in the validation group. 


Design 


Apple Watch VO2max 


T T T T T T 


10 20 30 40 50 60 
Reference VO2max 


Validation 


Apple Watch VO2max 


T T T T T T 


10 20 30 40 50 60 


Reference VO2max 


Figure 4: Reference compared with estimated VO2 max (ml/kg/min) for participants in the design and validation sets 
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Table 2. VO2 max performance 


Metric Description Design (N = 534) Validation (N = 221) 
Validity Error (mean estimated VO2 max - mean 12+4.4 14+47 

projected submax reference) — ml/kg/min 

(mean + SD) 
Reliability ICC A-1 comparison of last VO2 max estimate 0.89 [0.86, 0.91] 0.86 [0.80, 0.90] 


using data and metadata from only that 
session to a VO2 max estimate more than 28 
days previous using only data and metadata 
from that session — ICC [confidence interval] 


Consistency SD of pvO2max per user — ml/kg/min (% of pVO2 max) 


Median 1.2 (3.7%) 1.2 (3.4%) 
90th percentile 2.6 (7.6%) 2.6 (7.2%) 
Availability Percentage of outdoor pedestrian workouts 79% 78% 
longer than 5.75 minutes receiving an 
estimate 
Percentage of participants who completed at 93% 93% 


least 10 outdoor pedestrian workouts longer 
than 5.75 minutes and received at least one 
estimate in their first 10 workouts 


Discussion 


Assessment of CRF using VO» max has received increasing attention as a means of risk stratification, 

with some advocating for its consideration as a vital sign.5 In practice and despite demonstrated benefits 
in application, objective measurement of CRF using CPET remains infrequent in part due to expense, 
participant burden, and limited acceptance as standard of care across multiple specialties.30 Accurate 
estimation of a user's VO2 max through wearable technology could expand CRF screening to a large 
segment of the population at lower cost, and it may allow for remote monitoring of patients between clinical 
visits in programs such as cardiac rehabilitation. Such accurate and available estimates of VO2 max may 
also be used to guide risk stratification and response to programs designed to reduce risk, such as 
preoperative assessment and rehabilitation." 


The improved algorithm for VO max estimation on Apple Watch described here was designed for and 
validated in a population with reference VO2max covering a wide range of cardio fitness levels, as shown 

in figure 4. Nearly half the study participants were over the age of 55, and approximately 10 percent had 
known coronary artery disease. Racial and ethnic diversity in the study population didn't approximate the 
U.S. population; however, heart rate — a key input to VO2 max estimates on Apple Watch — has been 
shown to be consistently accurate across multiple skin tones in both internal and external studies.32 


Extension of VO max estimates to lower ranges in watchOS 7 combined with estimation outside of 
workouts increases the availability of this metric for individuals with low cardio fitness. Over 90 percent 

of participants with at least one outdoor walk, outdoor run, or hiking workout longer than three minutes and 
tracked using the Workout app received at least one estimate of VO2 max on Apple Watch. Increasing the 
number of outdoor pedestrian workouts will increase the likelihood and accuracy of a VO2 max estimate on 
Apple Watch. 
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VO2 max estimation by Apple Watch is accurate and reliable relative to commonly used methods of 
measuring VO2 max, with an average error of less than 1 MET and an ICC of more than 0.85. The accuracy 
of VO2 max on Apple Watch approaches the accuracy of the reference; submaximal exercise testing 
protocols have previously been measured to have approximately zero mean error and a standard error of 

1 MET.83 In terms of test-retest reliability, VO2 max on Apple Watch has an ICC of 0.87 in the validation data 
compared with 0.75 in submaximal treadmill testing.34 


With the new algorithm, VO2 max estimates for users who take heart rate-limiting medications — such 
as beta blockers and calcium channel blockers — and report this in the Health app on iPhone paired 

to Apple Watch should be more accurate relative to estimates made in previous versions of ¡OS and 
watchOS. The handling of these medications doesn't differentiate dosage, cardioselectivity, or intrinsic 
sympathomimetic activity of some beta blockers, all of which could potentially be meaningful inputs but 
have been omitted in favor of usability. With this approach, the estimated error for users in the validation 
cohort who take beta blockers and calcium channel blockers decreased from 11.8 + 4.0 ml/kg/min to 

1.6 + 3.1 ml/kg/min when the Health Details settings appropriately reflected their medication usage. Users 
who take heart rate-limiting medications and don't enter this information will receive higher-than-actual 
estimates; those who take low or as-needed doses of these medications that don't consistently reduce 
max heart rate (for example, propranolol for performance anxiety) will likely receive more accurate 
estimates if they don't enter this information. Given the highly prevalent usage of these medications, 35 
accounting for them appropriately is critical to accurately estimating VO2 max, especially for older users. 


In some conditions, a user's VO max estimate may be inaccurate. Users with an incorrect age, sex, 

or weight entered in the Health app may have consistently inaccurate VO2 max estimates. Normal 
physiological changes associated with pregnancy may lead to inaccurate estimates. Individual estimates 
may be low if sensor data is recorded during behaviors that increase the user's work in ways that 

Apple Watch can't accurately detect. Common instances of such behaviors include carrying significant 
weight beyond body weight, such as a heavy backpack or child, and walking or running on ground that 
increases the user's work, such as sand. Similarly, using an assistive device or pushing a stroller may 
decrease availability or accuracy of VO2 max estimates on Apple Watch. Factors that increase heart rate, 
such as dehydration, caffeine intake, extreme heat, or recent transition to high altitudes may also lead to 
underestimates. VO2 max accuracy on Apple Watch can be increased by performing frequent outdoor 
pedestrian workouts, by achieving higher exertion during workouts, and by wearing Apple Watch 
consistently throughout the day beyond typical workout sessions. 


Users with chronotropic incompetence — a condition where heart rate doesn't appropriately increase 
to compensate for demand38 — may receive overestimates of VO2 max. Chronotropic incompetence is 
primarily associated with heart failure, which occurs in approximately 30 to 80 percent (depending on 
diagnostic criteria) of patients with the condition.3” It has also been linked to significant proportions 

of patients with chronic obstructive pulmonary disease (COPD),38 lupus,39 and other autoimmune 
conditions.4° 


In addition to chronotropic incompetence, other medical conditions can also decrease the accuracy of VO2 
max estimates on Apple Watch. These include medical conditions or devices that decouple heart rate from 
movement or exercise (for example, pain, arrhythmias, pacemakers, or cardiac-assist devices); medical 
conditions that severely limit exercise tolerance, preventing patients from reaching heart rates close to 
their predicted maximum heart rate (for example, peripheral arterial disease); and medical conditions that 
significantly increase the difficulty of ambulation, such as skeletal or neuromuscular conditions causing gait 
inefficiency (for example, multiple sclerosis or cerebral palsy). 
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Conclusions 


With watchOS 7 on Apple Watch Series 3 and later, VO max estimates have been expanded to lower 
ranges of cardio fitness while providing users with the option to receive a notification if their cardio fitness 
level is low for their age and sex. This extended range, along with increased estimate availability and 

the option for users taking heart rate-limiting medications to receive more accurate estimates than were 
previously available, may increase researchers’ and clinicians’ ability to use this metric for tracking fitness 
in older adults and in the presence of comorbid health conditions. 
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